57 results found.
Written
Corpus,
Language Type:
Bilingual
Languages:
English Romanian
Availability:
Freely Available
License:
Size:
None Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Arya D. McCarthy | wmt16 data | /N |
Documentation:
None
Written
Corpus,
Language Type:
Multilingual
Languages:
Afrikaans Albanian Amharic Arabic Aragonese Armenian Assamese Azerbaijani Basque Belarusian Bengali Bosnian Breton Bulgarian Burmese Catalan Central Khmer Chinese Croatian Czech Danish Dutch Dzongkha English Esperanto Estonian Finnish French Gaelic Galician Georgian German Greek Gujarati Hausa Hebrew Hindi Hungarian Icelandic Igbo Indonesian Irish Italian Japanese Kannada Kazakh Kinyarwanda Korean Kurdish Kyrgyz Latvian Limburgan Lithuanian Macedonian Malagasy Malay Malayalam Maltese Marathi Mongolian Nepali Northern Sami Norwegian Norwegian Bokmål Norwegian Nynorsk Occitan Oriya Panjabi Pashto Persian Polish Portuguese Romanian Russian Serbian Serbo-Croatian Sinhala Slovak Slovenian Spanish Swedish Tajik Tamil Tatar Telugu Thai Turkish Turkmen Uighur Ukrainian Urdu Uzbek Vietnamese Walloon Welsh Western Frisian Xhosa Yiddish Yoruba Zulu
Availability:
Freely Available
License:
Size:
55 million sentences Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Biao Zhang | the open parallel corpus (OPUS) | /N |
Documentation:
None
Not Applicable
Contextualsed word embeddings,
Language Type:
Monolingual
Languages:
Ancient Arabic Basque Bokmål Bulgarian Catalan Chinese Church Croatian Czech Danish Dutch English Estonian Finnish French Galician German Greek Hebrew Hindi Hungarian Indonesian Irish Italian Japanese Korean Latin Latvian Norwegian Nynorsk Old Persian Polish Portuguese Romanian Russian Simplified Chinese Slavonic Slovak Slovene Spanish Swedish Turkish Ukrainian Urdu Uyghur Vietnamese
Availability:
Freely Available
License:
none
Size:
18.4 GByte Production Status:
Existing-used
Use:
Parsing and Tagging
-
Paper title:Treebank Embedding Vectors for Out-of-domain Dependency Parsing
-
Paper track:Short/Syntax: Tagging, Chunking and Parsing
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joachim Wagner | Elmo For Many Languages | /N |
Documentation:
https://www.aclweb.org/anthology/K18-2005/
Written
Corpus,
Language Type:
Multilingual
Languages:
Chinese Czech English Finnish German Latvian Romanian Russian Turkish
Availability:
Freely Available
License:
Size:
3.9 MByte Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:Automatic Machine Translation Evaluation using Source Language Inputs and Cross-lingual Language Model
-
Paper track:Short/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Kosuke Takahashi | WMT18 metrics shared task data | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Portuguese Romanian Russian Spanish
Availability:
Freely Available
License:
CreativeCommons
Size:
500 hours Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Adapting Transformer to End-to-End Spoken Language Translation
-
Paper track:12.1 Spoken machine translation/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mattia A. Di Gangi | MuST-C | /N |
Documentation:
None
,
Language Type:
Monolingual
Languages:
Romanian
Availability:
License:
Size:
None Production Status:
Use:
-
Paper title:Ongoing phonologization of word-final voicing alternations in two Romance languages: Romanian and French
-
Paper track:2.1 Phonetics and phonology/Oral Presentation
-
Paper status:Accept - Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Mathilde Hutin | Corpus Quaero | /N |
Documentation:
None
Speech/Written
Corpus,
Language Type:
Multilingual
Languages:
Dutch English French German Italian Portuguese Romanian Spanish
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial-NoDerivs 4.0 License
Size:
None Production Status:
Newly created-finished
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:MuST-Cinema: a Speech-to-Subtitles corpus
-
Paper track:Multimodality/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Alina Karakanta | MuSt-Cinema | /N |
Documentation:
Documentation publicly available in English
Written
Corpus,
Language Type:
Bilingual
Languages:
English Romanian
Availability:
Freely Available
License:
Size:
1 GByte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation
-
Paper track:Long/Machine Translation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Chenze Shao | wmt16 | /N |
Documentation:
None
Written
Corpus,
Language Type:
Monolingual
Languages:
Romanian
Availability:
Freely Available
License:
GNU
Size:
30 MByte Production Status:
Newly created-finished
Use:
Language Identification
-
Paper title:MOROCO: The Moldavian and Romanian Dialectal Corpus
-
Paper track:Long/Resources and Evaluation
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Radu Tudor Ionescu | MOROCO | /N |
Documentation:
English
Written
Corpus,
Language Type:
Multilingual
Languages:
English French German Romanian
Availability:
Freely Available
License:
Size:
None GByte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Filtering Back-Translated Data in Unsupervised Neural Machine Translation
-
Paper track:Short paper/
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Jyotsana Khatri | WMT data | /N |
Documentation:
None




